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The effects of word frequency pn comprehension were investigated in relation - a 


: . . he . : ‘ Pg! 
to content type, vocabulary instruction, student sex andre stion type. 
120 sixth grade (competent) readers responded to three types of compre- 

: ~ w Lr. ; 

hension questions after reading two types of stories and receiving one of 
: - os ‘ 
three vocabulary treatments. Analysis of variance revealed that alt#ring 
word frequency did not affect comprehension, but altering question.type and 
content type did significantly afféct comprehension scores. Question type 
and’ content type interacted each otbet ‘and with ot er, factors, indicating 


the necessity of controlling these two factors in comprehension reSearch.: 


‘ 
e ‘ 
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‘ F = 
4 ” Word frequency has long been an finfluential variable in explaining 


and predicting the difficulty people/ have in processing written material. 
Its effect on visual duration cheeshld is a well-replicated finding 
(e.g. Solomon and Postman, 1952; Howes and Solomon, 1951): more frequent 


words are recognized at lower aeesbanaes its effect on children's word 


, aeouE HecanTen ability is a more recent finding (e.g. Pearson and Studt 


. 


(1975). Historically, it has pee included as a direct (e.g. the Sig 


Dale-Chall formula (1948)) or an indirect (e.g. using woitd length as a 
. e ‘ ® s 


surrogate as in Fry (1968)) factor in readability formulas, even though 
Klare (1974-75) reports that direkt manipulation of frequency has only 


occasionally produced effects on comprexenezon, 
/ s 
‘However, in direct conflict to Klare’ s @onclusion se the finding Of 


. 


Marks, Doctorow and Wittrock (1974) that altering the ‘word frequency te 


values of one-sixth of the running words in story length passages had 


< ai 


a significant effect on the comprehension of a students © ; ; : 
irrespective of reading ability. P : i i? 
What Mark$, Doctorow .and wiitwock ata was ,to divide the passages’ into ; 
; : ” A MN 4K, 
blocks of six words: Then for guetigté of the blocks, they Vocated equally : 
, Pa 4 - 


. 


r 


long (same number of letters) lower fréquency synonyms for higher 


frequency words in the text. In'the other half, they located higher 


frequency synonyms for lower frequency words. The net result was the crea- 
tion of two versions of each passage that differed only in thp frequency 
; : , 


valuesS of one-sixth of the words. Word length was constant across versions. 
By testing a mixture of literal and inferential multiple-choice questions 
oe a 


after each version, they determined that manipulating word frequency 


substantially altered comprehension when readability was held constant. 
They concluded that authors of text materials should heed their findings 


and consider the effect they might have if they searched for simpler 
77 
synonyms for some'of their more baffling lexical entries. 


r 


Several questions arise from their study. First, can one locate words 
of equal length but different frequencies that are -truly synonomous? 
It is possible that conceptual difficulty was not maintained between 
‘versions of a passage. We did not find this to be true, however. We asked 


° 


a group of ten graduate students to judge the synonymity of the synonym 


a 


pairs used by Marks et al. While the pairs were regarded as unacceptable 
out of context, they were judged acceptable in context. 

Second, did they really hold readability constant? On the face of 
it, any readabjlity formula that uses word ences as a factor in its 
regression equation would yield a positive answer. However, one: has to ask 
why word length is used as a factor ina readability formula. Historically, 
it arose as an éxtrapolatton from Zipf's (1945) Law that there is an inverse 
logarithmic relationship between word frequency and word length: less 
frequent words tend to be longer than more frequent words. If word frequency 


and word length were plotted on a graph using arithmetic (rather than 


logarithmic) increments on both axes, the scatter-plot would resemble a 


fat cigar more than a straight line. Hence it would 


: 


‘ ; 
yee 


increases. In short, it is possiblegthat Marks, Doctorow, and 
: —s : Soe a Fee ; . 
were taking advantage of the imperfect nature of the relationship 


frequency and length: By this ‘argument,;a readability formula that 


direct medsure of frequency ought to yield variant readability values 


their passages, which were equatedsfor length but varied in word frequency. 


v . 


This is exactly what we found when we applied the Dale-Chall{1948) formula . 
to their passages. For example their passage, "Conductor «*Moses". was rated, 


as 6.9 in the high frequency version but 10.5 in the’ low frequency version. 
; ie . 


Vikewise, "Bail-Out" was rated 7.9 in the high frequency version and 10.2°in 
the low frequency version. Over all five passages used by Marks, Doctorow 


and| Wittrock, the typical-différence between high and low frequency versions 
| : 
was| about three grade levels. 


Even so, the question of the influence of the frequency affect remains. 


All we have done is to show that readablakty was confounded in their 


a | . | 
experiment. The effect of word frequency is an important educational issue. 
et dhe. recommends that high frequency words be used whenever they fit 
the content, one very quickly gets into a logical paradox and an instructional 


x 


bind. By the logic of the high frequency rule, one would ultimately end up 


‘with a few well-known words to a ean the range of complexity of 


: , 
human experience. The instructional bind is, ofcourse, How gre students 


’ e 
@ ever to learn any new words if they are exposed only to familiar words. 


| 
| 


One way out of the bind is the common practice of vocabulary instruction; 


that is; we ought to, and in fact often do, preteach the difficult vocabu- 


et . ‘. ae 


‘ * lary in a passage, emphasizing both word meaning and word identification. 
. é ’ 


While vocabulary instruction is frequently recommended (e.g. Karlin, 1967; 


My > = 
Durkin, 1974), the method is often vague or unspecified. Sometimes. teacher 


. instruction is urged (McCullough, 1957; Gray and Holmes, 1938), and some- 


times self-instruction through the use of a glossary (Huus, 1963). Research 


is unclear regarding the direct effect of spec?fic vocabulary instruction, 
; oy hoe 


. ea 


but a recent study (Tuinman and Brady, 1973) suggests that it may have a 
generalized effect on vocabulary test scares but not on comprehension 


test scores. “What we need to know is whether or not students learn the 


vocabulary taught for a specifit selection and whether such learning has . a 


‘a‘positive effect on comprehension of selections in which this vocabulary \ 


is embedded. 


The present study was designed to investigate the relationship, between 


vocabulary instruction and the comprehension of passages differing in the 


average word frequency level. We reasoned that vocabulary instfuction 


e 


might have an effect in versions employing unfamiliar (infrequ¢nt) words 
but not in versidns employing familiar (frequent) words. 


In order to more precisely specify the conditions under which these 


effects might opérate, we included three other variables in the experiment. 


First, w&manipulated content type, employing one scientific exposition . 
and one historical narrative written in the biographical genre.: Second, 
we,varied question type over three levels varying from almost total textual 


reliance to minimal textual reliance, reasoning that vocabulary instruction , 
a ' . ‘ 
. might be more effective for those questions which were most textually 
. rs 
reliant. Finally we blocked subjects om-sex, male and female, reasoning that 
e . ‘ : : 
sex might interact with content type and responsiveness to ,ingtruction. 
: ‘ < p 


oS 7 ; a; . sd m aa ; *% 


~For this last, consideration, there is-some evidence to suggest an - . 
4 ) . - 


_-interaction between‘ content type and sex. Girls appear to have somewhat 
: ‘ s Ss P - 


different reading interests than boys (Chui, 1973), and boys' comprehension 


&*% fs ; . 
scores appear to be-more affected by, the levelsof personal interest in the 


story (Asher and Markell, 1974). American males generally achieve higher 
g scores than females when the item cootent is scientific,™~mechanical, * . 


business, practical affairs, or mathematicalfwhile females achieve higher 


ws ., XX 
scores than males on verbal tests when the content is human relations or 


the arts and humanities (Coffman, 1961; Donlon and Angoff, 1973). . 


“ : . : 
) In terms“of possible experimental effects, we hypothesized that ‘ 


y irrespective of the main effects for word frequency or vocabulary instruction, 


. . x 
there would be a vocabulary instruction X frequency interaction such that 
' + ye " : ri 
‘vocabulary instruction would increase comprehension in passages written { 
- . oe 
with low but not high, frequency words., Second, we expected to find'a 


: 
vocabulary instruction X question type interaction, indicating that 

vocabulary instruction would be especially helpful for’ questions which were most 
textually reliant. Finally we expected that sex would interact with content 


t type in, accordance with previous findings regarding male and female interests. ( 
4 


for different, types of reading material. ° : . i 


qn 


’ Method : : 
suk S = 8 ’ a, y 
esign. ‘ 
vevBe is .  . ; 
2 ' ; 
‘There were three between-subject variables: sex (male or female) 


word frequency (high or low)*and vocabulary instruction (none, teacher 


: directed jor glossary). Content type (technical exposition or historical 


, 


narrative) and question type (literal, infetential, or scriptal) were within 


subject, variables. Equal numbers, of male and female students were 


randamly assigned to each of the six word: frequency X vocabulary. instruction 
; y iY 

. . . Peas. 3 . Ps aye 
treatment.conditions. All students answered five of each of the three 

types of questions following each of the two types of content. The data 


were analyzed using a three-between, two-within analysis of variance 


(Winer, 1971). Effects significant beyond A = .01 were regarded as 


significant. This level was chosen because with such a large number of 


F ‘tests, we were more «concerned with, Type I than Type II error. 


\ . 


Sub jects ae 


. 


Subjects were 120 sixth grade stydents attending two elementary schools 
J re Let 3 ; } g 


‘ 


in a middle class suburb of Minneapolis. Pe subjects were selected by 
: . ! 


’ 


participating teachers after being given instructions to chaose sixth 


\ 


graders who read at or above grade level (on the basis! of standardized 


test scores). Initially, 145 students wére selected. From that group, 


: « 
six treatment groups of 20 students each were randomly selected, with the 


provision that each group contained 10 maTes and, 10 females. 


‘ - 


Materials 


Content Type. Two modified stories (also used in the Marks et al. 
study) selected from the SRA Réading Laboratories, served as the experimental 


reading materials. "Conductor Moses" is a narrative, biographical account 
a 


of Harriet Tubman's attempts to teScue slaves. 1"Bail Out" is a technical 


‘account of a “pave forced to bail ouk gee hfs jet ak it caught fire., 


An SFI index of 50 (Carroll, Davies,.and Richadr, 1971) was used as 


the cut off point between high and low frequency words.! The stories were 


divided into blocks of six words. In approximately one-half the blocks, a 


aoe synonym’ was substituted for:a low SERqueNey word’ of equal 


ength;,in the other half, a low for a “high frequency word? By grouping 


together the resulting high and low frequency blocks, two versions of each 


story were created differing only in the frequency, of one-sixth of .the 


words. 


rf 
/ 


Questiohs. For each story, 15 multiple choice questions, each with 


four choices, were generated, five for each questiog type. Litera) questions 


were generated by performing a whskcanetormakion on a specific text sqgment 
so as to replace an immediate st sae of the segment with the wh-word 


\ 


(Bormuth, Manning, Carr and Pearson, 1970). For example, "Harriet Tubman 
helped the slaves escape to Canada'"' > "Where did Harriet Tubman belp 


the slaves escape to?" 


& 


- Tee intecéose Mingmrenans questions: were geiucaned. by identifying 
a relationship betwedr ts grammatically un¢onnécted text ‘segments and, 
asking a question which required their logical ‘or pragmatic integration. 
‘For. example, the text, segment, "They walked tdose enough together so, that 


they could touch each other in the dark. It wouldn't do for anyone td 


cabl out if he got lost from the party," might generate the text inference 


so that ‘they could \ 
e ae ‘ 
touch #ach other in the dark." In general for text inference questions, 


question, "Why’ did they walk close enough together 


‘both thé question and thé “answer are in the text but the question-answer 


relationship is not obvious or grammatically cited. 
a. : r| = 2 
¥ Scriptal guestions were even less textually reliant: Intuitively, 


. ‘ 


they require a reading of the text, to get into the right mental set 

(script (Schank, 1973) or schema (Anderson, 1977) if youprefer) to answer 
the question, but the answer must come from prior knowledge It. is just not 
available in the text. For example, the question, "When did Harriet’ Tubman 
live?" could be answered from a knowledge of American history .and the 


Civil War. Nowhere in the text is even an approximate date given. In 


7 short, the question "came from" the text, but the answer had to come from 
( . 
_ prior knowledge. 


One other important feature of questions. Each was written in such 


é 
a way that at least one word in the item was a third synonym substitute 
4 ‘ 


, . 
for a word that been varied between the two versions (high and low 


' 
ae 


frequency) of each story. For example if "afraid" appeared in the high 


frequency version and "apprehensive" -in the low frequency version, then 


‘ . 
Yanxious" appeared in the,questign. These so-called-neutral synodyms 


were selected such that they possessed SFI walues midway between the SFI 


v 
. 


vatues @f the high and low frequency synonyms. This was.done in order to 
” ° “ af . . 


. . a 


ie 


<3 make certain that the question probes tapped text segments which differed A 
2 between versions and to elimi tdce the likelihood that the questions ®&ere 
2 ” . ‘ . rs a 
in any way biased toward one version or the other. ‘ P 
ft 


Once all the questions had been generated, a single random order for 


each set of 15 questions was determined and used for all subjects.in all 


‘ @ 

: treatment conditions. The reliability of the total test (KR20) was .75, 
thus falTing in between Downie's (1967) recommendations for standardized s 
tests (.90) and classroom 'tésts (.50). . . 


, Glossaries. Since one of the vocabulary pre-instruetidn.treatments 


x x 


; involved the use of a glossary, we created two glossaries 


(one high frequency and one low frequency) for each story. Each glossary 


~ 


contained 15’ vocabulary items that seemed essential to an undetstanding 


of the story and were directly related to the questions. 


In the glossary, each word was presénted, defined and used in a 


sample sentence. For example: {''Afraid. gfraid means anxious or appre- 

hensive. For example, the girY’was afraid of flunking the test." This 

ts an example from the high requency glossary. For the student reading , * 
‘ ., the lew Srauteney glossary, -the -_ sea read as follows: "Apprehensive. 


Apprehensive means-anxious or afraid. For example, the girl was 


. \apprehensive, about flunking the test." Each version of the glossary 


taught from the high frequency or low frequency version to the other two 


synonyms. In other words, the high frequency version presented the 
a % , . 
neutral (in the question) and low frequency synonyms, while the low frequency 
. ‘ . 
version presented the neutral and hon frequency synonyms: ~ 
~ © Se 


, 


Procedure : a 3 


Students were tested in groups of twenty (ten males-and ten females), 


for, a thirty minute period. The first two- groups (one high and one low 


. s 
° 


10 


>? — received no vacabulary instruction. They were given a simple 

expttation of ene purpose of the study and handed the first story to 

? read. Each child,. when finished with the story, raised his or her hand, 

- Fefurned the story to the experimenter, and received the. fifteen multiple 
choice questions for that story. The same procedure was repeated with 

the ieetba story. Story, order was randomized within each treatment group, © 


s 
The second two groups received vocabulary instruction in the form of 


; the Hiand-out sii. The experimenter handed each student a one page 
gives along with beck story, with the iodine instructions: "You 
will probably want to read the definitions before you read the stipes.” 
‘The third two groups received direct sccmiabaey instruction by the 


experimenter before they read each stoty. They were told by the exper- 
imenter: "I'm going to go over some words in the sede which might ‘give 

you trouble. This will probably help you answer the questions at the ait 
of the story."' The experimenter then Sent reviewed the same words on 

the previous glossary list, presenting exactly the same definition for % 
each vérd. The difference vas that this time the childten heardeach 

\ 

definition, but did not have the oppor bunt hy to read it. Under no conditions 
did the pilin have the opportunity to refer to the text while answering 


. : « 
questions, : > 


Las ‘a 


: are ; Results 


- OF the: five main effects, only those waascieul with the within-subjects 
variables vendtead significance. The content type effect, F (1,108) = 
46.513 p<, indicated that the peteutitie exposition (x = 10.45) was 
more difficult than the historical narrative (% = 11.98). There was 


considerably variation among question types, with scriptal questions proving 


ae eee —¢ ul 
the easiest (x = 8.28), followed by literal (x = 7.75) and text inference 
questions (x = 6,40), F (2,216) = 70.34, p<.ol. However, both of these 
¢ . 


variables were involved in higher order interactions, 
v 


The lack of a student sex effect was masked by a cross over interaction 
between sex'and content type, F (1,108) = 9.73, p<.Ol, Girls (X,= 12.28) 
surpassed boys (x = 11.67) on the historical narrative, while boys (x = 10.82) 
surpassed girls (x = 10.08) on the technical exposition. 


\ 


Purthermore, the two significant main effects, content type and question 


type, also interacted, F (2,216) = 23.35,p,<..01. The interaction appears, in large 


‘"‘measure, to be due to the differentia], effect of literal questions between 


content types. Whereas -the difference between inference questions from, 
ne version to another (x diff = .28) is of the same oe aes as the differ- 
. . . 


ence for scriptal questions (x diff = .14), the same difference for literal 


Pic sueettbns was much larger (xX diff = 1.13). In short, the techiical 


exposition was a bit harder than the historical narrative for scriptal and 


’ 


text-inference questions, but it)proved much harder for the literal questions. 
* ; In fact most of the main afiect gt content type is acectbutable to the . 
Particularly depressing effect bf ijteral questions for the technical 2 
exposition. : ; : es j 
Of the sixteen three and four way interaction tests, one proved to 
be significant: vocabular¥*x, conteft x qilectda type, F (4,216) = 3.11, 
p<0l. The sheer power of a tent with that mae degrees of~freedom should ; 
elicit eeutbon in interpreting the interaction. | Interestingly it is the 
; only point in the statistical analysis where vokshulges instruction emerg- 
“a . a , af 


ed assignificant fa¢tor: : . — ‘ 
\ P - . . 


ye" . 
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12 
” 
About all the interaction-analysis merits is to say that relative to 
no instruction or glossary instruction, teacher instructioh was particularly 
facilttating for inference questions in the technical exposition. In fact, 
if one compares the nine compgrable question type X vocabulary instruction 


means across the.two content types, there is only one case in which a 


; $ 

technical exppsition mean was greater than an historical narrative mean, 
NS . 

i.e. for teacher instruction (xqy = 2.95, XTE = 3.43). 


Discussion 
' a i | 
Few of the experimental hypotheses were verified. First, we were hot 
; ; 


able to replicate the Marks, Doctorow and Wittrock findings. High frequency 


‘Y¥ 


versions (x = 11.34) wére no more difficult than low frequency versions ‘ 
(x = 11.07), F (1,108) = .510, p. > .5. _Granted we crossed frequency with ? 
sex and vocabulary instruction between subjects. Yet: none of the two 

three way interactions involving those three variables approached signi- 

ficance (the largest F ratio was 1.74 for the three way “{nteractipn). 

Furthermore, looking at the simple effects within the no iveidicn condition 
summed acconk sex, there is no hint of an effect for - frequency. The only 

systematic difference between the Marks et 41. study and the; pleséat study 


was question type. Questions in the present study were written “in a 

: whe . 
neutral language based upon text segments containing words whose lexical 
form varied between high and low fraquency versions. Apparently this* 


neutrality was sufficient to destroy-the frequency effect. However,‘we have 


perused the questions used by Marks et al. and we find no systematic bias 


‘that could account for such. variant findings. Even -so, the results 6f the 


D 7 , : 
present study are congruent with a long line of research suggesting that - 


frequency manipulations by themselves gre not sufficiently powerful to 


affect comprehension (Klare, 1974-75). 


o ~ 


wu 19 ms 
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’ 
Second, we were not able to demonstrate de effect for vocabulary 

instruction at all, let alone one exc sively for low frequency stories 

pr particular question types, Here, hobever, we are forced to admit to a“ 
methodological weakness: the vocabulary instruction in this atudy was 
neither intensive, systematic .nor complete. Nor did we assess prior know- 
reage. It is possible that these students (sixth grdde students reading at 
grade level) knew al} er ee low frequency words, although this is 
unlikely in view of the 10-11 grade level readability values. The only shred 
$f support for wovabulany intervention gomes from the sarend type .X question 


type X vocabulary instructfon interaction. It seems to say that if you have 


' 


technical material and difficyle text integration questions, it helps to 

have a,teacher present the wore eention for understanding the questions, 
What is enadad is a’ ebeonger test of vocabulary distedetion-<some tntenstee 
concept building--in situations where one is sure the concepts are unfamiliar. 
On the, other .hand, che wank tests of pocahulaey ftervention is the present 


study may be more typical of what actual happens in classrooms. . 


The sex X content interaction is consistent with the research on sex 


differences and content (Coffman, 1961; Donlon and Anghoff, 1973), suggesting 


that boys are more interested in technical matters while girls are more 
‘ é 
interested in family relationships. However, the present data are not 


congruent with Asher and Maxkell's (1974) conclusion dae givis are more 
tolerant of shifts away from personally interesting material. 

The question typé variable was originally included in order to determine 
whether or not some of the more central varteblhs produced effects generaliz- 


able across different kinds of comprehenSion.probes. It turned out to be the 


most potent variable in the study, and the results are somewhat counter-intuitive. 
: e 


| . 


As educators we tend to regard factual recognition items as representing 
the lowest» (and hence the easiest) level of conprahenston (cf. Bloom, 1956; 
Barrett, 1968). Yet scriptal questions, which require logical gontacttona 
between textual-information and internation stored in memory, were at least 


. ‘ 
as easy as literal questions, and they were definitely easier for the technical 


\ - 
exposition (i,e., the content type X question type interaction). >» 


é 
Perhaps more surprising is the low level of performance for text inference 


questions. Remember these questigns required the reader to. make a logical 
Or pragmatic (statistical or probabilistic) connection between two textually 
presented segments of text (prepositiens). 


In relationship to what gets into long term. memory storage the question 


is, Why should it be more difficult to ptocess two textual propositions and 
draw some connection between them than it: is to process and store as atceie 
textual proposition (literal) or to relate a single text proposition to a 
proposition already in long term storage (scriptal)? 

If one hypothesizes that memory for propositions is a function df 
how well they fit with pre-existing schema (e.g. Anderson, 1977, or. 
Bower, 1976) the present data are more reasonable. If schematic knowledge 
is strong, as apparently it was for things like syaptoms of periods of 
history or most likgly mativations for human actions (the kinds of content 
tapped by scriptal questions), propositions are easily assimilated into 
memory tand readily available for retrieval.. This Le vatlected in the higher 
scores for scriptal questions. When schematic knowledge is weak (as it 
probably was for the very specific factual propositions in both the literal 
and text inference items), propositions are less likely to remain te storage. 


Furthermore, wher the relationship between two propositions is not explicit 


(e.g. ‘in the text inference questions) it is possible that the two proposi-~ 


‘ f 


ue 


/ 


tions get into memory storage but they get into different schemata. Hence, 


' - 


N 


retrieval. of a relationship between, them is more difficult. 
( 


If this explanation is accurate, then comprehension for text inference 


questions should be improved relative to comprehensién for literal and * 
’ 

‘scriptal questions if subjects are allowed free access to th® text while 
. - ‘ 2 | 


answering questions, i.e. when subjects are allowed to actively seagth 


® 
the text for such relationships. There should be almost no improvement for 


’ ’ 
scriptal questions gince they are as reliant on schemata as they are on text, 


Comprehension for literal ‘questions should be improved by an intermediate 


amount. ; 


As a follow-up, we gave the low frequency--no vocabulary instruction-- 
a . ; \ 
versions of as two stories to a group of fourteen subjects from the same 


population, allowing them free access to the-text while answeting the questions. 
‘s . . , . e 


“We contrasted this new gtoup (12 of 14 were girls)-Jith the comparable ‘no 


access--low frequency--no vocabulary instruction female group (N=10). We 


, es 


found partial support for this explanation. The difference. between access 


, 
bd . - 


and no access groups was significant for inference questions (Xdiff = 2.37), 


. 


. ‘ 
t(22) = 4.23, p. <.01, but not significant for literal (xdifg = .51J or . 


scriptal (xqiff = BB) qiaettons. ‘ Lie ee 

~ White such dk-emphention is targely conjecture, it is loosely 
consistent with the relative difficulty of question eet 40 the m¥in study 
and quite consistent with the findings in the follow-up study. And the 
notion of schematic assimilation-continues to gather ewpiricdl sipport 

. F ‘ - 

(Anderson, 1976; Bower, 1976; Thorndyke, 1977). Unfortunately, schematic 
knowledge was not assesséd or ‘manipulated independently in the present 


investigation. Hence the conjectwre label must stand until more appropriate 


research is conducted, 


%, 
The limitations of this study deserve note. First, a stronger test of 


the vocabulary instruction variable could have been provided. Second, 


even though the’ passages were lengthy, one passage each of the two content 
types hardly constitutes a sample from some population of discourse one 
might like to- generalize to. Third, inferences about content type are Jimited 


by the fact that pretagonist sex, topic, amd style were. completely confounded. 


One was a historical narrative with a female protagonist; the other, a 


scientific exposition with a male protagonist. The confounding was ;not 
- ’ 


accidental. We chdse to maximize the likelihood that content type would 


operate as an aa factor. It remains for future research to sort 


out the relative impact of the three variables. Fourth, we chose a father 


restricted population of subjects. With less able readers, it is not 
? ¢ 


unreasonable to expect shifts in performancé& across question types or 


e 


frequency levels. 2s 
Perhaps the most important contribution of the present study is that 
it establishes the necessity of examining comprehension asa multi-level 


factor incorporating both mind and (reading) matter. 


Footnotes F P = 
1. The SFI scale is logarithmic: SFI = 40 corresponds to 100 occurrences, 
per million, SFI = 50, 10; SFI = 60, 1. It was not altogether clear 
what Marks, Doctorow and Wittrock had done; however by examining a 


sample of 70 substitutions, werinteegss that they must have used 
. SFI =-50 as a cut-off. i : 


+ 


2. We tried to parallel Marks et al. as closely as possible, They 
indicated that they tried to substitute "the word most amenable to 
substitution." We adopted a rule that we would try to substitute 
"the first lexical token for which we could find a synonym that 
met the high/low frequency criterion." In practice, our rule did 
not differ significantly from.that used by Marks et al. 

ei 


he 
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